Time-frequency distributions for automatic speech recognition

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Time-frequency distributions for automatic speech recognition

The use of general time-frequency distributions as features for automatic speech recognition (ASR) is discussed in the context of hidden Markov classifiers. Short-time averages of quadratic operators, e.g., energy spectrum, generalized first spectral moments, and short-time averages of the instantaneous frequency, are compared to the standard front end features, and applied to ASR. Theoretical ...

متن کامل

Optimizing Time-Frequency Distributions for Automatic Classification

An entirely new set of criteria for the design of kernels (generating functions) for time-frequency representations (TFRs) is presented. These criteria aim only to produce kernels (and thus, TFRs) which will enable more accurate classification. We refer to these kernels, which are optimized to discriminate among several classes of signals, as signal class dependent kernels, or simply class depe...

متن کامل

Time-Frequency Features For Speech Recognition

Time-Frequency Features For Speech Recognition by James G. Droppo III Chair of Supervisory Committee Professor Les E. Atlas Electrical Engineering Conventional speaker independent, continuous speech recognition systems are built upon assumptions that are, in general, not met. This dissertation focuses on one deficiency in particular, that the non-stationary speech signal is modeled as a single ...

متن کامل

Automatic Classification of Positive Time- Frequency Distributions

A method of performing automatic classification of positive time-frequency distributions is presented. These distributions are computed via constrained optimization, minimizing the cross-entropy of the distribution subject to a set of constraints. An algorithm for clustering using cross-entropy as the distance measure between vectors was derived by Shore and Gray [14]. We apply this method to t...

متن کامل

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Speech and Audio Processing

سال: 2001

ISSN: 1063-6676

DOI: 10.1109/89.905994